Overview
Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 1000 |
| Missing cells | 158 |
| Missing cells (%) | 0.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 470.7 KiB |
| Average record size in memory | 482.0 B |
Variable types
| Text | 1 |
|---|---|
| Numeric | 8 |
| Categorical | 7 |
| DateTime | 2 |
category is highly overall correlated with price | High correlation |
price is highly overall correlated with category | High correlation |
age has 49 (4.9%) missing values | Missing |
annual_income has 50 (5.0%) missing values | Missing |
loyalty_score has 59 (5.9%) missing values | Missing |
transaction_id has unique values | Unique |
Reproduction
| Analysis started | 2026-02-23 11:06:51.890622 |
|---|---|
| Analysis finished | 2026-02-23 11:07:04.086841 |
| Duration | 12.2 seconds |
| Software version | ydata-profiling vv4.18.1 |
| Download configuration | config.json |
Variables
transaction_id
Text
Unique
| Distinct | 1000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 61.7 KiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Unique
| Unique | 1000 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | T00001 |
|---|---|
| 2nd row | T00002 |
| 3rd row | T00003 |
| 4th row | T00004 |
| 5th row | T00005 |
| Value | Count | Frequency (%) |
| t00001 | 1 | 0.1% |
| t00002 | 1 | 0.1% |
| t00003 | 1 | 0.1% |
| t00004 | 1 | 0.1% |
| t00005 | 1 | 0.1% |
| t00006 | 1 | 0.1% |
| t00007 | 1 | 0.1% |
| t00008 | 1 | 0.1% |
| t00009 | 1 | 0.1% |
| t00010 | 1 | 0.1% |
| Other values (990) | 990 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2299 | |
| T | 1000 | |
| 1 | 301 | 5.0% |
| 2 | 300 | 5.0% |
| 3 | 300 | 5.0% |
| 4 | 300 | 5.0% |
| 5 | 300 | 5.0% |
| 6 | 300 | 5.0% |
| 7 | 300 | 5.0% |
| 8 | 300 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2299 | |
| T | 1000 | |
| 1 | 301 | 5.0% |
| 2 | 300 | 5.0% |
| 3 | 300 | 5.0% |
| 4 | 300 | 5.0% |
| 5 | 300 | 5.0% |
| 6 | 300 | 5.0% |
| 7 | 300 | 5.0% |
| 8 | 300 | 5.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2299 | |
| T | 1000 | |
| 1 | 301 | 5.0% |
| 2 | 300 | 5.0% |
| 3 | 300 | 5.0% |
| 4 | 300 | 5.0% |
| 5 | 300 | 5.0% |
| 6 | 300 | 5.0% |
| 7 | 300 | 5.0% |
| 8 | 300 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2299 | |
| T | 1000 | |
| 1 | 301 | 5.0% |
| 2 | 300 | 5.0% |
| 3 | 300 | 5.0% |
| 4 | 300 | 5.0% |
| 5 | 300 | 5.0% |
| 6 | 300 | 5.0% |
| 7 | 300 | 5.0% |
| 8 | 300 | 5.0% |
customer_id
Real number (ℝ)
| Distinct | 637 |
|---|---|
| Distinct (%) | 63.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 507.486 |
| Minimum | 1 |
|---|---|
| Maximum | 1000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 52.95 |
| Q1 | 267.75 |
| median | 521.5 |
| Q3 | 744.25 |
| 95-th percentile | 945.15 |
| Maximum | 1000 |
| Range | 999 |
| Interquartile range (IQR) | 476.5 |
Descriptive statistics
| Standard deviation | 286.79851 |
|---|---|
| Coefficient of variation (CV) | 0.5651358 |
| Kurtosis | -1.1686127 |
| Mean | 507.486 |
| Median Absolute Deviation (MAD) | 236.5 |
| Skewness | -0.07015154 |
| Sum | 507486 |
| Variance | 82253.383 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 707 | 5 | 0.5% |
| 570 | 5 | 0.5% |
| 642 | 5 | 0.5% |
| 651 | 5 | 0.5% |
| 745 | 5 | 0.5% |
| 359 | 4 | 0.4% |
| 102 | 4 | 0.4% |
| 633 | 4 | 0.4% |
| 478 | 4 | 0.4% |
| 835 | 4 | 0.4% |
| Other values (627) | 955 |
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 2 | 1 | 0.1% |
| 3 | 1 | 0.1% |
| 5 | 2 | |
| 6 | 2 | |
| 7 | 1 | 0.1% |
| 8 | 2 | |
| 9 | 2 | |
| 10 | 1 | 0.1% |
| 12 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 1000 | 2 | |
| 996 | 3 | |
| 995 | 2 | |
| 994 | 1 | 0.1% |
| 993 | 2 | |
| 992 | 1 | 0.1% |
| 991 | 1 | 0.1% |
| 989 | 1 | 0.1% |
| 986 | 1 | 0.1% |
| 985 | 1 | 0.1% |
product_id
Real number (ℝ)
| Distinct | 100 |
|---|---|
| Distinct (%) | 10.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.406 |
| Minimum | 1 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 25 |
| median | 50 |
| Q3 | 76 |
| 95-th percentile | 96 |
| Maximum | 100 |
| Range | 99 |
| Interquartile range (IQR) | 51 |
Descriptive statistics
| Standard deviation | 29.223538 |
|---|---|
| Coefficient of variation (CV) | 0.57976309 |
| Kurtosis | -1.226461 |
| Mean | 50.406 |
| Median Absolute Deviation (MAD) | 26 |
| Skewness | 0.0055117714 |
| Sum | 50406 |
| Variance | 854.01518 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 18 | 17 | 1.7% |
| 51 | 17 | 1.7% |
| 91 | 16 | 1.6% |
| 85 | 16 | 1.6% |
| 4 | 16 | 1.6% |
| 46 | 16 | 1.6% |
| 11 | 15 | 1.5% |
| 86 | 14 | 1.4% |
| 7 | 14 | 1.4% |
| 74 | 14 | 1.4% |
| Other values (90) | 845 |
| Value | Count | Frequency (%) |
| 1 | 10 | |
| 2 | 9 | |
| 3 | 8 | |
| 4 | 16 | |
| 5 | 9 | |
| 6 | 8 | |
| 7 | 14 | |
| 8 | 10 | |
| 9 | 10 | |
| 10 | 12 |
| Value | Count | Frequency (%) |
| 100 | 13 | |
| 99 | 7 | |
| 98 | 12 | |
| 97 | 9 | |
| 96 | 10 | |
| 95 | 7 | |
| 94 | 10 | |
| 93 | 12 | |
| 92 | 10 | |
| 91 | 16 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4 |
|---|---|
| 2nd row | 4 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 261 | |
| 2 | 250 | |
| 4 | 249 | |
| 1 | 240 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 261 | |
| 2 | 250 | |
| 4 | 249 | |
| 1 | 240 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 261 | |
| 2 | 250 | |
| 4 | 249 | |
| 1 | 240 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 261 | |
| 2 | 250 | |
| 4 | 249 | |
| 1 | 240 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 261 | |
| 2 | 250 | |
| 4 | 249 | |
| 1 | 240 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 261 | |
| 2 | 250 | |
| 4 | 249 | |
| 1 | 240 |
purchase_date
Date
| Distinct | 599 |
|---|---|
| Distinct (%) | 59.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| Minimum | 2022-01-01 00:00:00 |
|---|---|
| Maximum | 2024-06-17 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 695 | |
| 0 | 305 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 695 | |
| 0 | 305 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 695 | |
| 0 | 305 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 695 | |
| 0 | 305 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 695 | |
| 0 | 305 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 695 | |
| 0 | 305 |
age
Real number (ℝ)
Missing
| Distinct | 57 |
|---|---|
| Distinct (%) | 6.0% |
| Missing | 49 |
| Missing (%) | 4.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 46.361725 |
| Minimum | 18 |
|---|---|
| Maximum | 74 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 32 |
| median | 47 |
| Q3 | 61 |
| 95-th percentile | 73 |
| Maximum | 74 |
| Range | 56 |
| Interquartile range (IQR) | 29 |
Descriptive statistics
| Standard deviation | 16.711785 |
|---|---|
| Coefficient of variation (CV) | 0.36046513 |
| Kurtosis | -1.1649677 |
| Mean | 46.361725 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | -0.0012251337 |
| Sum | 44090 |
| Variance | 279.28375 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 74 | 30 | 3.0% |
| 18 | 27 | 2.7% |
| 26 | 26 | 2.6% |
| 34 | 25 | 2.5% |
| 40 | 25 | 2.5% |
| 51 | 25 | 2.5% |
| 41 | 24 | 2.4% |
| 49 | 24 | 2.4% |
| 50 | 23 | 2.3% |
| 56 | 23 | 2.3% |
| Other values (47) | 699 | |
| (Missing) | 49 | 4.9% |
| Value | Count | Frequency (%) |
| 18 | 27 | |
| 19 | 15 | |
| 20 | 11 | |
| 21 | 19 | |
| 22 | 10 | 1.0% |
| 23 | 20 | |
| 24 | 10 | 1.0% |
| 25 | 16 | |
| 26 | 26 | |
| 27 | 21 |
| Value | Count | Frequency (%) |
| 74 | 30 | |
| 73 | 21 | |
| 72 | 14 | |
| 71 | 17 | |
| 70 | 15 | |
| 69 | 18 | |
| 68 | 16 | |
| 67 | 21 | |
| 66 | 19 | |
| 65 | 21 |
gender
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 60.6 KiB |
| Male | |
|---|---|
| Other | |
| Female |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 4.941 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Other |
|---|---|
| 2nd row | Other |
| 3rd row | Male |
| 4th row | Male |
| 5th row | Other |
Common Values
| Value | Count | Frequency (%) |
| Male | 364 | |
| Other | 331 | |
| Female | 305 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 364 | |
| other | 331 | |
| female | 305 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1305 | |
| a | 669 | |
| l | 669 | |
| M | 364 | 7.4% |
| O | 331 | 6.7% |
| t | 331 | 6.7% |
| h | 331 | 6.7% |
| r | 331 | 6.7% |
| F | 305 | 6.2% |
| m | 305 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4941 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1305 | |
| a | 669 | |
| l | 669 | |
| M | 364 | 7.4% |
| O | 331 | 6.7% |
| t | 331 | 6.7% |
| h | 331 | 6.7% |
| r | 331 | 6.7% |
| F | 305 | 6.2% |
| m | 305 | 6.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4941 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1305 | |
| a | 669 | |
| l | 669 | |
| M | 364 | 7.4% |
| O | 331 | 6.7% |
| t | 331 | 6.7% |
| h | 331 | 6.7% |
| r | 331 | 6.7% |
| F | 305 | 6.2% |
| m | 305 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4941 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1305 | |
| a | 669 | |
| l | 669 | |
| M | 364 | 7.4% |
| O | 331 | 6.7% |
| t | 331 | 6.7% |
| h | 331 | 6.7% |
| r | 331 | 6.7% |
| F | 305 | 6.2% |
| m | 305 | 6.2% |
city
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 62.2 KiB |
| Rajkot | |
|---|---|
| Delhi | |
| Ahmedabad | |
| Mumbai | |
| Vadodara |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 6.515 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Rajkot |
|---|---|
| 2nd row | Vadodara |
| 3rd row | Mumbai |
| 4th row | Mumbai |
| 5th row | Delhi |
Common Values
| Value | Count | Frequency (%) |
| Rajkot | 179 | |
| Delhi | 173 | |
| Ahmedabad | 173 | |
| Mumbai | 170 | |
| Vadodara | 158 | |
| Surat | 147 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| rajkot | 179 | |
| delhi | 173 | |
| ahmedabad | 173 | |
| mumbai | 170 | |
| vadodara | 158 | |
| surat | 147 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1316 | |
| d | 662 | 10.2% |
| e | 346 | 5.3% |
| h | 346 | 5.3% |
| m | 343 | 5.3% |
| b | 343 | 5.3% |
| i | 343 | 5.3% |
| o | 337 | 5.2% |
| t | 326 | 5.0% |
| u | 317 | 4.9% |
| Other values (10) | 1836 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6515 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1316 | |
| d | 662 | 10.2% |
| e | 346 | 5.3% |
| h | 346 | 5.3% |
| m | 343 | 5.3% |
| b | 343 | 5.3% |
| i | 343 | 5.3% |
| o | 337 | 5.2% |
| t | 326 | 5.0% |
| u | 317 | 4.9% |
| Other values (10) | 1836 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6515 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1316 | |
| d | 662 | 10.2% |
| e | 346 | 5.3% |
| h | 346 | 5.3% |
| m | 343 | 5.3% |
| b | 343 | 5.3% |
| i | 343 | 5.3% |
| o | 337 | 5.2% |
| t | 326 | 5.0% |
| u | 317 | 4.9% |
| Other values (10) | 1836 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6515 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1316 | |
| d | 662 | 10.2% |
| e | 346 | 5.3% |
| h | 346 | 5.3% |
| m | 343 | 5.3% |
| b | 343 | 5.3% |
| i | 343 | 5.3% |
| o | 337 | 5.2% |
| t | 326 | 5.0% |
| u | 317 | 4.9% |
| Other values (10) | 1836 |
annual_income
Real number (ℝ)
Missing
| Distinct | 606 |
|---|---|
| Distinct (%) | 63.8% |
| Missing | 50 |
| Missing (%) | 5.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1223201.7 |
| Minimum | 122258 |
|---|---|
| Maximum | 2499544 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 122258 |
|---|---|
| 5-th percentile | 232582.35 |
| Q1 | 635492 |
| median | 1157025 |
| Q3 | 1810788 |
| 95-th percentile | 2310970 |
| Maximum | 2499544 |
| Range | 2377286 |
| Interquartile range (IQR) | 1175296 |
Descriptive statistics
| Standard deviation | 673620.26 |
|---|---|
| Coefficient of variation (CV) | 0.55070253 |
| Kurtosis | -1.2103419 |
| Mean | 1223201.7 |
| Median Absolute Deviation (MAD) | 584766 |
| Skewness | 0.12480239 |
| Sum | 1.1620416 × 109 |
| Variance | 4.5376426 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1845114 | 5 | 0.5% |
| 159565 | 5 | 0.5% |
| 293990 | 5 | 0.5% |
| 1800299 | 5 | 0.5% |
| 2129251 | 4 | 0.4% |
| 1492214 | 4 | 0.4% |
| 1512532 | 4 | 0.4% |
| 948324 | 4 | 0.4% |
| 1912650 | 4 | 0.4% |
| 1841881 | 4 | 0.4% |
| Other values (596) | 906 | |
| (Missing) | 50 | 5.0% |
| Value | Count | Frequency (%) |
| 122258 | 3 | |
| 122319 | 1 | 0.1% |
| 125053 | 2 | 0.2% |
| 129898 | 2 | 0.2% |
| 130809 | 1 | 0.1% |
| 132422 | 2 | 0.2% |
| 133557 | 1 | 0.1% |
| 140072 | 1 | 0.1% |
| 155020 | 2 | 0.2% |
| 159565 | 5 |
| Value | Count | Frequency (%) |
| 2499544 | 1 | |
| 2491733 | 1 | |
| 2480623 | 2 | |
| 2474670 | 2 | |
| 2458777 | 2 | |
| 2454193 | 2 | |
| 2451282 | 2 | |
| 2434644 | 2 | |
| 2422760 | 2 | |
| 2422524 | 2 |
signup_date
Date
| Distinct | 547 |
|---|---|
| Distinct (%) | 54.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| Minimum | 2018-01-02 00:00:00 |
|---|---|
| Maximum | 2023-06-22 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
loyalty_score
Real number (ℝ)
Missing
| Distinct | 582 |
|---|---|
| Distinct (%) | 61.8% |
| Missing | 59 |
| Missing (%) | 5.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.704166 |
| Minimum | 1.4 |
|---|---|
| Maximum | 99.73 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 1.4 |
|---|---|
| 5-th percentile | 6.23 |
| Q1 | 26.53 |
| median | 49.83 |
| Q3 | 76.28 |
| 95-th percentile | 93.88 |
| Maximum | 99.73 |
| Range | 98.33 |
| Interquartile range (IQR) | 49.75 |
Descriptive statistics
| Standard deviation | 28.327537 |
|---|---|
| Coefficient of variation (CV) | 0.55868264 |
| Kurtosis | -1.1759913 |
| Mean | 50.704166 |
| Median Absolute Deviation (MAD) | 24.9 |
| Skewness | 0.013455043 |
| Sum | 47712.62 |
| Variance | 802.44938 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 89.23 | 5 | 0.5% |
| 56.05 | 5 | 0.5% |
| 71.53 | 5 | 0.5% |
| 39.51 | 5 | 0.5% |
| 76.45 | 5 | 0.5% |
| 88.66 | 5 | 0.5% |
| 37.58 | 5 | 0.5% |
| 45.07 | 4 | 0.4% |
| 91.11 | 4 | 0.4% |
| 28.2 | 4 | 0.4% |
| Other values (572) | 894 | |
| (Missing) | 59 | 5.9% |
| Value | Count | Frequency (%) |
| 1.4 | 1 | 0.1% |
| 1.62 | 1 | 0.1% |
| 1.71 | 2 | |
| 1.85 | 1 | 0.1% |
| 2.36 | 1 | 0.1% |
| 2.38 | 3 | |
| 2.39 | 2 | |
| 2.43 | 1 | 0.1% |
| 2.47 | 1 | 0.1% |
| 2.98 | 2 |
| Value | Count | Frequency (%) |
| 99.73 | 1 | 0.1% |
| 99.52 | 1 | 0.1% |
| 99.36 | 1 | 0.1% |
| 99.05 | 1 | 0.1% |
| 98.82 | 1 | 0.1% |
| 98.78 | 2 | |
| 98.61 | 1 | 0.1% |
| 98.53 | 2 | |
| 98.45 | 3 | |
| 98.44 | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 799 | |
| 0 | 201 | 20.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 799 | |
| 0 | 201 | 20.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 799 | |
| 0 | 201 | 20.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 799 | |
| 0 | 201 | 20.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 799 | |
| 0 | 201 | 20.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 799 | |
| 0 | 201 | 20.1% |
category
Categorical
High correlation
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 62.0 KiB |
| Toys | |
|---|---|
| Home | |
| Groceries | |
| Clothing | |
| Books |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 6.389 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Groceries |
|---|---|
| 2nd row | Books |
| 3rd row | Electronics |
| 4th row | Clothing |
| 5th row | Home |
Common Values
| Value | Count | Frequency (%) |
| Toys | 250 | |
| Home | 197 | |
| Groceries | 186 | |
| Clothing | 154 | |
| Books | 108 | |
| Electronics | 105 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| toys | 250 | |
| home | 197 | |
| groceries | 186 | |
| clothing | 154 | |
| books | 108 | |
| electronics | 105 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1108 | |
| e | 674 | |
| s | 649 | |
| r | 477 | 7.5% |
| i | 445 | 7.0% |
| c | 396 | 6.2% |
| t | 259 | 4.1% |
| n | 259 | 4.1% |
| l | 259 | 4.1% |
| T | 250 | 3.9% |
| Other values (10) | 1613 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6389 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 1108 | |
| e | 674 | |
| s | 649 | |
| r | 477 | 7.5% |
| i | 445 | 7.0% |
| c | 396 | 6.2% |
| t | 259 | 4.1% |
| n | 259 | 4.1% |
| l | 259 | 4.1% |
| T | 250 | 3.9% |
| Other values (10) | 1613 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6389 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 1108 | |
| e | 674 | |
| s | 649 | |
| r | 477 | 7.5% |
| i | 445 | 7.0% |
| c | 396 | 6.2% |
| t | 259 | 4.1% |
| n | 259 | 4.1% |
| l | 259 | 4.1% |
| T | 250 | 3.9% |
| Other values (10) | 1613 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6389 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 1108 | |
| e | 674 | |
| s | 649 | |
| r | 477 | 7.5% |
| i | 445 | 7.0% |
| c | 396 | 6.2% |
| t | 259 | 4.1% |
| n | 259 | 4.1% |
| l | 259 | 4.1% |
| T | 250 | 3.9% |
| Other values (10) | 1613 |
price
Real number (ℝ)
High correlation
| Distinct | 100 |
|---|---|
| Distinct (%) | 10.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7988.055 |
| Minimum | 174.87 |
|---|---|
| Maximum | 73715.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 174.87 |
|---|---|
| 5-th percentile | 579.59 |
| Q1 | 1250.97 |
| median | 2617.55 |
| Q3 | 5919.6325 |
| 95-th percentile | 35068.14 |
| Maximum | 73715.1 |
| Range | 73540.23 |
| Interquartile range (IQR) | 4668.6625 |
Descriptive statistics
| Standard deviation | 12775.886 |
|---|---|
| Coefficient of variation (CV) | 1.5993739 |
| Kurtosis | 8.1173278 |
| Mean | 7988.055 |
| Median Absolute Deviation (MAD) | 1568.3 |
| Skewness | 2.7350137 |
| Sum | 7988055 |
| Variance | 1.6322327 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1049.25 | 17 | 1.7% |
| 1250.97 | 17 | 1.7% |
| 579.59 | 16 | 1.6% |
| 28165.65 | 16 | 1.6% |
| 1696.13 | 16 | 1.6% |
| 5893.35 | 16 | 1.6% |
| 1509.99 | 15 | 1.5% |
| 5620.98 | 14 | 1.4% |
| 1388.41 | 14 | 1.4% |
| 1192.1 | 14 | 1.4% |
| Other values (90) | 845 |
| Value | Count | Frequency (%) |
| 174.87 | 5 | 0.5% |
| 498.06 | 10 | |
| 500.57 | 11 | |
| 509.15 | 12 | |
| 529.99 | 9 | |
| 579.59 | 16 | |
| 596.84 | 12 | |
| 630.48 | 10 | |
| 663.71 | 10 | |
| 698.7 | 4 | 0.4% |
| Value | Count | Frequency (%) |
| 73715.1 | 6 | 0.6% |
| 60872.09 | 12 | |
| 57496.3 | 8 | |
| 43365.65 | 7 | |
| 42704.08 | 11 | |
| 35068.14 | 7 | |
| 28480.84 | 8 | |
| 28275.9 | 9 | |
| 28165.65 | 16 | |
| 23609.45 | 9 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.573 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 5 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 5 |
| 5th row | 5 |
Common Values
| Value | Count | Frequency (%) |
| 5 | 261 | |
| 15 | 242 | |
| 10 | 173 | |
| 0 | 166 | |
| 20 | 158 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 5 | 261 | |
| 15 | 242 | |
| 10 | 173 | |
| 0 | 166 | |
| 20 | 158 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 503 | |
| 0 | 497 | |
| 1 | 415 | |
| 2 | 158 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1573 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 5 | 503 | |
| 0 | 497 | |
| 1 | 415 | |
| 2 | 158 | 10.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1573 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 5 | 503 | |
| 0 | 497 | |
| 1 | 415 | |
| 2 | 158 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1573 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 5 | 503 | |
| 0 | 497 | |
| 1 | 415 | |
| 2 | 158 | 10.0% |
stock_qty
Real number (ℝ)
| Distinct | 87 |
|---|---|
| Distinct (%) | 8.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 253.195 |
| Minimum | 7 |
|---|---|
| Maximum | 497 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 29 |
| Q1 | 140 |
| median | 246 |
| Q3 | 388 |
| 95-th percentile | 463 |
| Maximum | 497 |
| Range | 490 |
| Interquartile range (IQR) | 248 |
Descriptive statistics
| Standard deviation | 139.21431 |
|---|---|
| Coefficient of variation (CV) | 0.5498304 |
| Kurtosis | -1.137067 |
| Mean | 253.195 |
| Median Absolute Deviation (MAD) | 113 |
| Skewness | 0.050993246 |
| Sum | 253195 |
| Variance | 19380.624 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 166 | 29 | 2.9% |
| 79 | 28 | 2.8% |
| 408 | 26 | 2.6% |
| 163 | 24 | 2.4% |
| 29 | 24 | 2.4% |
| 433 | 21 | 2.1% |
| 127 | 21 | 2.1% |
| 273 | 19 | 1.9% |
| 359 | 18 | 1.8% |
| 441 | 18 | 1.8% |
| Other values (77) | 772 |
| Value | Count | Frequency (%) |
| 7 | 8 | 0.8% |
| 11 | 13 | |
| 12 | 10 | 1.0% |
| 13 | 9 | 0.9% |
| 15 | 7 | 0.7% |
| 29 | 24 | |
| 30 | 5 | 0.5% |
| 40 | 9 | 0.9% |
| 60 | 9 | 0.9% |
| 79 | 28 |
| Value | Count | Frequency (%) |
| 497 | 15 | |
| 487 | 12 | |
| 478 | 8 | |
| 470 | 7 | 0.7% |
| 469 | 7 | 0.7% |
| 463 | 10 | |
| 456 | 7 | 0.7% |
| 454 | 9 | |
| 445 | 10 | |
| 441 | 18 |
rating
Real number (ℝ)
| Distinct | 34 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.7413 |
| Minimum | 1 |
|---|---|
| Maximum | 4.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1.1 |
| Q1 | 1.7 |
| median | 2.6 |
| Q3 | 3.3 |
| 95-th percentile | 4.8 |
| Maximum | 4.9 |
| Range | 3.9 |
| Interquartile range (IQR) | 1.6 |
Descriptive statistics
| Standard deviation | 1.1616341 |
|---|---|
| Coefficient of variation (CV) | 0.42375299 |
| Kurtosis | -0.94774691 |
| Mean | 2.7413 |
| Median Absolute Deviation (MAD) | 0.8 |
| Skewness | 0.39909837 |
| Sum | 2741.3 |
| Variance | 1.3493937 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.2 | 68 | 6.8% |
| 2.2 | 53 | 5.3% |
| 1.5 | 52 | 5.2% |
| 2.9 | 51 | 5.1% |
| 1.6 | 46 | 4.6% |
| 1.9 | 43 | 4.3% |
| 4.8 | 43 | 4.3% |
| 2.8 | 41 | 4.1% |
| 3 | 41 | 4.1% |
| 1.4 | 37 | 3.7% |
| Other values (24) | 525 |
| Value | Count | Frequency (%) |
| 1 | 29 | |
| 1.1 | 27 | |
| 1.2 | 23 | |
| 1.3 | 19 | 1.9% |
| 1.4 | 37 | |
| 1.5 | 52 | |
| 1.6 | 46 | |
| 1.7 | 24 | |
| 1.8 | 20 | 2.0% |
| 1.9 | 43 |
| Value | Count | Frequency (%) |
| 4.9 | 35 | |
| 4.8 | 43 | |
| 4.7 | 17 | 1.7% |
| 4.6 | 26 | |
| 4.5 | 9 | 0.9% |
| 4.4 | 35 | |
| 4.3 | 16 | 1.6% |
| 4 | 33 | |
| 3.9 | 8 | 0.8% |
| 3.6 | 10 | 1.0% |
Interactions
Correlations
| age | annual_income | category | city | customer_id | discount | gender | is_active | loyalty_score | price | product_id | purchased | quantity | rating | stock_qty | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| age | 1.000 | -0.025 | 0.026 | 0.113 | 0.067 | 0.000 | 0.118 | 0.070 | 0.092 | -0.034 | 0.030 | 0.029 | 0.000 | 0.002 | -0.003 |
| annual_income | -0.025 | 1.000 | 0.000 | 0.111 | -0.007 | 0.000 | 0.124 | 0.138 | 0.041 | 0.000 | -0.046 | 0.066 | 0.000 | -0.005 | -0.023 |
| category | 0.026 | 0.000 | 1.000 | 0.000 | 0.035 | 0.209 | 0.030 | 0.000 | 0.045 | 0.508 | 0.254 | 0.000 | 0.000 | 0.270 | 0.260 |
| city | 0.113 | 0.111 | 0.000 | 1.000 | 0.094 | 0.024 | 0.068 | 0.043 | 0.101 | 0.009 | 0.029 | 0.007 | 0.000 | 0.029 | 0.000 |
| customer_id | 0.067 | -0.007 | 0.035 | 0.094 | 1.000 | 0.030 | 0.102 | 0.045 | 0.005 | 0.004 | 0.004 | 0.025 | 0.000 | 0.007 | -0.029 |
| discount | 0.000 | 0.000 | 0.209 | 0.024 | 0.030 | 1.000 | 0.051 | 0.032 | 0.064 | 0.248 | 0.253 | 0.000 | 0.000 | 0.298 | 0.254 |
| gender | 0.118 | 0.124 | 0.030 | 0.068 | 0.102 | 0.051 | 1.000 | 0.033 | 0.095 | 0.000 | 0.000 | 0.051 | 0.000 | 0.000 | 0.000 |
| is_active | 0.070 | 0.138 | 0.000 | 0.043 | 0.045 | 0.032 | 0.033 | 1.000 | 0.193 | 0.000 | 0.000 | 0.039 | 0.000 | 0.060 | 0.027 |
| loyalty_score | 0.092 | 0.041 | 0.045 | 0.101 | 0.005 | 0.064 | 0.095 | 0.193 | 1.000 | -0.037 | 0.053 | 0.000 | 0.061 | 0.012 | -0.011 |
| price | -0.034 | 0.000 | 0.508 | 0.009 | 0.004 | 0.248 | 0.000 | 0.000 | -0.037 | 1.000 | -0.020 | 0.014 | 0.000 | -0.003 | 0.024 |
| product_id | 0.030 | -0.046 | 0.254 | 0.029 | 0.004 | 0.253 | 0.000 | 0.000 | 0.053 | -0.020 | 1.000 | 0.122 | 0.000 | -0.194 | 0.151 |
| purchased | 0.029 | 0.066 | 0.000 | 0.007 | 0.025 | 0.000 | 0.051 | 0.039 | 0.000 | 0.014 | 0.122 | 1.000 | 0.000 | 0.000 | 0.000 |
| quantity | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.061 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| rating | 0.002 | -0.005 | 0.270 | 0.029 | 0.007 | 0.298 | 0.000 | 0.060 | 0.012 | -0.003 | -0.194 | 0.000 | 0.000 | 1.000 | 0.056 |
| stock_qty | -0.003 | -0.023 | 0.260 | 0.000 | -0.029 | 0.254 | 0.000 | 0.027 | -0.011 | 0.024 | 0.151 | 0.000 | 0.000 | 0.056 | 1.000 |
Missing values
Sample
| transaction_id | customer_id | product_id | quantity | purchase_date | purchased | age | gender | city | annual_income | signup_date | loyalty_score | is_active | category | price | discount | stock_qty | rating | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | T00001 | 325 | 11 | 4 | 2023-12-24 | 0 | 44.0 | Other | Rajkot | 864504.0 | 2018-07-10 | 46.90 | 1 | Groceries | 1509.99 | 5 | 156 | 2.5 |
| 1 | T00002 | 580 | 24 | 4 | 2022-01-25 | 1 | 52.0 | Other | Vadodara | NaN | 2020-04-11 | NaN | 1 | Books | 2901.81 | 0 | 436 | 3.3 |
| 2 | T00003 | 343 | 76 | 2 | 2023-06-12 | 1 | 26.0 | Male | Mumbai | 266007.0 | 2022-05-28 | 39.17 | 1 | Electronics | 23392.87 | 0 | 186 | 1.6 |
| 3 | T00004 | 570 | 22 | 2 | 2022-07-01 | 0 | 45.0 | Male | Mumbai | NaN | 2020-08-17 | 71.53 | 1 | Clothing | 2891.78 | 5 | 169 | 1.9 |
| 4 | T00005 | 645 | 52 | 1 | 2023-05-14 | 0 | 26.0 | Other | Delhi | 1076659.0 | 2023-04-03 | 20.48 | 0 | Home | 15124.37 | 5 | 237 | 1.4 |
| 5 | T00006 | 595 | 30 | 4 | 2024-04-20 | 1 | 63.0 | Female | Delhi | 321927.0 | 2023-04-22 | 46.33 | 1 | Groceries | 1889.88 | 15 | 487 | 2.0 |
| 6 | T00007 | 346 | 34 | 3 | 2023-10-03 | 0 | 20.0 | Female | Mumbai | 1670159.0 | 2022-03-28 | 4.26 | 1 | Clothing | 2850.59 | 5 | 300 | 2.8 |
| 7 | T00008 | 359 | 38 | 2 | 2022-02-11 | 0 | 26.0 | Other | Surat | 1512532.0 | 2019-06-12 | 77.57 | 1 | Home | 5816.79 | 5 | 433 | 4.9 |
| 8 | T00009 | 625 | 88 | 3 | 2023-04-26 | 1 | 56.0 | Female | Surat | 964925.0 | 2023-06-15 | 96.19 | 1 | Toys | 2144.89 | 15 | 258 | 4.7 |
| 9 | T00010 | 371 | 70 | 1 | 2024-04-27 | 1 | 70.0 | Female | Ahmedabad | 1330870.0 | 2019-01-20 | 17.74 | 1 | Clothing | 2516.66 | 15 | 15 | 2.2 |
| transaction_id | customer_id | product_id | quantity | purchase_date | purchased | age | gender | city | annual_income | signup_date | loyalty_score | is_active | category | price | discount | stock_qty | rating | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 990 | T00991 | 889 | 46 | 1 | 2022-08-30 | 1 | 19.0 | Female | Rajkot | 1649449.0 | 2020-11-11 | 26.55 | 1 | Toys | 5893.35 | 20 | 336 | 1.0 |
| 991 | T00992 | 316 | 32 | 4 | 2022-10-21 | 0 | 18.0 | Male | Ahmedabad | 621451.0 | 2020-07-04 | 17.28 | 1 | Home | 21526.00 | 0 | 402 | 2.6 |
| 992 | T00993 | 569 | 37 | 2 | 2023-07-17 | 0 | NaN | Other | Mumbai | 125053.0 | 2019-11-04 | 15.59 | 0 | Electronics | 42704.08 | 5 | 224 | 4.4 |
| 993 | T00994 | 126 | 24 | 3 | 2022-09-23 | 1 | 22.0 | Female | Vadodara | 863471.0 | 2019-01-28 | 41.31 | 1 | Books | 2901.81 | 0 | 436 | 3.3 |
| 994 | T00995 | 471 | 93 | 2 | 2022-04-30 | 1 | 50.0 | Other | Delhi | 1297133.0 | 2022-07-31 | 98.82 | 1 | Groceries | 596.84 | 5 | 225 | 3.2 |
| 995 | T00996 | 206 | 85 | 1 | 2023-12-13 | 0 | 63.0 | Other | Mumbai | 1960377.0 | 2022-10-12 | 87.84 | 1 | Electronics | 28165.65 | 5 | 408 | 4.3 |
| 996 | T00997 | 722 | 36 | 4 | 2022-09-05 | 1 | 46.0 | Female | Mumbai | 639527.0 | 2018-09-27 | 4.40 | 0 | Home | 22807.24 | 15 | 316 | 4.6 |
| 997 | T00998 | 220 | 40 | 2 | 2022-03-11 | 1 | 70.0 | Female | Rajkot | 1083923.0 | 2018-08-15 | 84.77 | 1 | Toys | 2580.47 | 0 | 234 | 3.2 |
| 998 | T00999 | 851 | 40 | 1 | 2024-05-02 | 0 | 61.0 | Female | Rajkot | 1006773.0 | 2021-06-07 | 30.94 | 1 | Toys | 2580.47 | 0 | 234 | 3.2 |
| 999 | T01000 | 508 | 73 | 3 | 2022-02-26 | 1 | 66.0 | Female | Delhi | 216105.0 | 2019-03-25 | NaN | 1 | Groceries | 1695.89 | 15 | 454 | 4.5 |